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Abstract 

We study a simple Markov chain, the switch chain, on the set of all perfect match¬ 
ings in a bipartite graph. This Markov chain was proposed by Diaconis, Graham and 
Holmes as a possible approach to a sampling problem arising in Statistics. We ask: for 
which classes of graphs is the Markov chain ergodic and for which is it rapidly mixing? 
We provide a precise answer to the ergodicity question and close bounds on the mix¬ 
ing question. We show for the first time that the mixing time of the switch chain is 
polynomial in the case of monotone graphs, a class that includes examples of interest 
in the statistical setting. 


1 Introduction 

Counting perfect matchings in a bipartite graph is an important computational problem: 
the permanent of 0-1 matrix. This, and the computationally equivalent problem of sampling 
matchings uniformly at random, has applications in Statistics and other areas. In [9], Dia¬ 
conis, Graham and Holmes considered the applications of the 0-1 permanent to Statistics, 
in particular where the 0-1 matrix has recognisable structure, which they called truncated 
or interval-restricted. 

The truncated 0-1 matrices are those for which the columns can be permuted to give the 
consecutive Vs property on rows. That is, no two I’s in any row are separated by one 
or more O’s. Diaconis, Graham and Holmes [9] considered “one-sided” truncation, where 
the consecutive I’s appear at the left of each row, and “two-sided” truncation, where the 
consecutive Is can appear anywhere in a row. For two-sided truncation, they considered two 
special cases. In the hrst, both the rows and columns can be permuted so that they have 
the consecutive I’s property. In the second, the rows and columns can be permuted so that 
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the consecutive I’s have a “staircase” presentation, the monotone case, which is of interest 
in statistical applications dU. 

Diaconis, Graham and Holmes [5] proposed a Markov chain for sampling perfect matchings 
in a bipartite graph, which we call the switch chain. They showed ergodicity of the chain 
for the truncated matrices considered in [9], and conjectured that it would converge rapidly. 
Computing the 0-1 permanent is a well-solved problem from a theoretical viewpoint. It 
is T^P-complete to compute exactly [36], but there is a polynomial time approximation 
algorithm [19]. However, the switch chain gives a simpler and more practical algorithm 
than that of [I9], making it worthy of consideration. Thus Diaconis, Graham and Holmes’s 
conjecture was subsequently studied in the PhD theses of Matthews [26] and Blumberg [3], 
and we will discuss their results below. 

A 0-1 matrix is equivalently the biadjacency matrix of a bipartite graph, and we will study 
the graphs corresponding to the matrices considered by Diaconis, Graham and Holmes [9]. 
We identify the largest hereditary graph class in which the switch chain is ergodic: chordal 
bipartite graphs. The graphs considered in [9] form an ascending sequence within this class. 
We examine the mixing time behaviour of the switch chain for graphs from these classes, 
extending the work of 0. a and [26]. We also discuss the difficulty of computing the 
permanent exactly in these classes, where Valiant’s result [36] does not necessarily imply 
T^P-completeness. 

In particular, we show for the hrst time that the mixing time of the switch chain is polynomial 
on monotone graphs. This is proved by a novel application of a simple combinatorial lemma, 
the solution to the so-called mountain climbing problem [la EQ 12H1 El EZ]. Though this 
lemma is well known, there appears to be no worst-case analysis of this problem in the 
literature. Therefore we provide such an analysis as an appendix. 

After this paper was written, we learned that Bhatnagar, Randall, Vazirani and Vigoda [3] 
had used a similar approach for a rather different problem. They analysed the Jerrum- 
Sinclair chain [18] for generating random dichromatic matchings in a graph that has its 
edges partitioned into two colour classes. 

For the necessary background information on Markov chains, see [IIEIE3]. For the relevant 
graph-theoretic background, see [1 m El EZ]- 

1.1 Notation and definitions 

Let N = {1,2,...} denote the natural numbers. If n G M, let [n] = (1, 2, . .., n} and, if 
ni, 77-2 e N, let [ni, 77 - 2 ] = {tti, 77i -M, ..., 772 }. 

We will also use the notation [n]' = {!', 2', ..., rr'} and [771,772]' = {77}, (771 -|- 1)', ..., 77 ' 2 }. 
Here the prime serves only to distinguish i from 7'. Ordering and arithmetic for [77]' elements 
follows that for [77]. Thus, for example, 1' < 2' and 1' -|- 2' = 3 '. 

A graph G = (V, E) is bipartite if its vertex set V = [m] U [77]' and there is no (undirected) 
edge {v,w) G E such that v,w E [m] or v,w E [77]'. Thus V comprises two independent sets 
[777] and [77]'. Bipartite graphs Gi = {[m] U [77]', Ei) and G 2 = {[m] U [77]', E 2 ) are isomorphic if 
there are permutations a of [777] and r of [77]' such that (j, k') E Ei if and only if (aj, Tk ') E E 2 . 
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Let G = ([m] U [n]\E) be a bipartite graph. We consider [m] and [n]' to have the usual 
linear ordering, and we will abuse notation by denoting these ordered sets simply by [m] and 
[n]'. Then let A{G) denote the m x n biadjacency matrix of G, with rows indexed by [m] 
and columns by [n]', such that A{iG') = 1 if (h/) ^ and A{iG') = 0 otherwise. We will 
use the graph and matrix terminology interchangeably. For example, we refer to rows and 
columns of G, or edges in 74(G). 

The neighbourhood in G of a vertex v G [m] U [n]' will be denoted by M{v). To avoid 
trivialities, we will assume that G has no isolated vertices, unless explicitly stated otherwise. 

A matching in a bipartite graph G = ([m] U [n]', E) is a set of independent edges, that is, no 
two edges in the set share an endpoint. A perfect matching is a set of edges such that every 
vertex of G lies in exactly one edge. For a bipartite graph G = ([m] U [n]',E) this requires 
m = n, and n independent edges in E. In particular, G can have no isolated vertices. We 
will call a bipartite graph with m = n balanced. 

Equivalently, a perfect matching may be viewed as n independent I’s in the nxn 0-1 matrix 
A{G). Thus a perfect matching M has edge set {{i,7i'i) ■ i G [n]}, where vr is a permutation 
of [n]. Equivalently, M has edge set {(cTj, j') : j G [n]}, where a is a permutation of [n]. Note 
that a = 7T~^ as elements of the symmetric group Sn- We may identify the matching M with 
the permutations tt and a. An example is shown in Fig. [1] below. 



Fig. 1: Bipartite graph with perfect matching a = (3241), tt = (4213). 


The total number of perfect matchings in a bipartite graph G is called the permanent, which 
we denote by per(A) when A = A{G). 


1.2 Computing the permanent 

The permanent has been studied extensively in Combinatorics and Computer Science. Valiant 
showed that computing the permanent exactly is 7 )^P-complete for a general 0-1 matrix |36j . 
No algorithm running in sub-exponential time is known for the exact evaluation of the per¬ 
manent of 0-1 matrices. 

Jerrum, Sinclair and Vigoda [19] showed that the 0-1 permanent has a fully polynomial 
randomised approximation scheme (FPRAS), using an algorithm for randomly sampling 
perfect matchings. This improved a Markov chain algorithm of Jerrum and Sinclair [T8] . 
which was not guaranteed to have polynomial time convergence for all bipartite graphs. 
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The algorithm of [19] is simple, but involves polynomially many repetitions of a polynomial- 
length sequence of related Markov chains. The best bound known for the running time of 
this algorithm is O(n^log^n), due to Bezakova, Stefankovic, Vazirani and Vigoda [2|. 

Jerrum, Valiant and Vazirani [201 showed that sampling almost uniformly at random and 
approximate counting have equivalent computational complexity for many combinatorial 
problems, including the permanent. Technically, the problem must be self-reducible. 

From the viewpoint of theoretical Computer Science, these results entirely settle the question 
of sampling and counting perfect matchings in bipartite graphs. However, simpler methods 
have been proposed for special cases of this problem, and here we consider one such proposal. 


1.3 The switch chain 


Diaconis, Graham and Holmes [9] proposed the following Markov chain for sampling perfect 
matchings from a balanced bipartite graph G = ([n] U [n]',E) almost uniformly at random, 
which we will call the switch chain. A transition of the chain will be called a switch. Diaconis, 
Graham and Holmes [9] called this a “transposition”. The switch chain generalises the 
transposition chain for generating random permutations. 


( 1 ) 

( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 


Switch chain 

Let the perfect matching at time t be the permutation vr of [n]. 

Set t ■(— 0, and let Mq be any perfect matching of G. 

Ghoose i,j G [n], uniformly at random, so (qvr'), (j, vr') G Mf. 

li i ^ i and (i, vr'), (j, tt') are both in E, 

set Mt+i ^ Mt\ {(v, Ti'i), (j, vr')} U {(v, vr'), (j, vr')}. 

Otherwise, set Mi+i ^ Mf. 


Set Iff< tmax, repeat from step (2) 


Otherwise, stop. 


Note that the switch chain is invariant under isomorphisms of G, so properties of the chain 
can be investigated from the viewpoint of graph theory. An example of a switch is shown 
below, with the edges (4,1'), (2,2') being switched for (4,2'), (2,1'). 


1' 4 3' 



Aft+i : 


1' 4 3' 
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Graph classes 

2.1 Chordal bipartite graphs 

The first question we might ask about the switch chain is: for which class of graphs is 
it ergodic? We wish to have a graph-theoretic answer to this question, so that we can 
recognise membership of graphs in the class. Therefore, we restrict attention to hereditary 
graph classes, that is, those for which all (vertex) induced subgraphs of every graph in the 
class are also in the class. Hereditary classes are central in modern graph theory, and are 
most usually characterised by describing a minimal set of “excluded subgraphs”, induced 
subgraphs which cannot be present. For example, perfect graphs are those which exclude all 
odd-length cycles {odd holes) of length at least 5, or their complements {odd antiholes) [7]. 
Thus, in particular, the class of perfect graphs contains all bipartite graphs, which exclude 
all odd holes and antiholes. All the graphs we consider here are bipartite, and hence perfect. 

In our application, there is a further technical reason for preferring to work with heredi¬ 
tary graph classes. We then have self-reducibility for most problems in ^P, including the 
permanent. This property implies the equivalence between almost uniform sampling and 
approximate counting referred to in Section 11.21 See [20] for details. 

The switch chain is ergodic on a graph G = (V, E) if the state space of the chain, the 
set of perfect matchings, is connected by switches. We extend this to include graphs with 
no perfect matching, where the state space is empty. Then we will say that a graph G is 
hereditarily ergodic if, for every U O V, the induced subgraph G[U] is ergodic. A class of 
graphs will be called hereditarily ergodic if every graph in the class is hereditarily ergodic. 

Diaconis, Graham and Holmes [9] observed that the switch chain is not ergodic for all 
balanced bipartite graphs. They gave the example: 

1 ' 2 ' 3 ' 

1 r 0 1 1 ■ 

1 ' 2 10 1 

3 110 


2' 3 



3' 2 


This graph has two perfect matchings, but the switch chain cannot move between them. 
This is because the graph is a chordless 6-cycle. In fact, this property characterises the class 
of graphs for which the switch chain is not ergodic, as we now show. 

We say a graph G is chordal bipartite if it has no chordless cycle of length other than four. 
The class of chordal bipartite graphs is clearly hereditary. Note that the dehnition of chordal 
bipartite graphs is an excluded subgraph characterisation. To show that the switch chain is 
ergodic for this class, we require the following “excluded submatrix” characterisation. 
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A r (Gamma) in a 0-1 matrix is an induced submatrix of the form 


A matrix is called F-free if it has no such induced submatrix. Lubiw ESI gave the following 
char act erisat ion. 

Theorem 1 (Lubiw). A bipartite graph is chordal bipartite if and only if it is isomorphic to 
a graph G such that A{G) is V-free. □ 

Moreover, Lubiw |25] showed that this characterisation can be used to recognise chordal 
bipartite graphs in 0{\E\ log \ E\) time. This was subsequently improved to 0{\E\) time by 
Uehara [32] • For the switch chain we have: 

Lemma 2. The largest hereditary class of bipartite graphs for which the switch chain is 
ergodic is the class of balanced chordal bipartite graphs. In this class, if G = {[n]U [n]',E), 
the diameter of the chain is at most n. 


Proof. Clearly any graph with an induced cycle of length greater than 4 cannot be in the 
class, so we need only show ergodicity for chordal bipartite graphs. The chain is aperiodic. 


since there is a loop probability at least 1 /n at each step, from choosing i = j in step ( 2 ) 
Thus we must show that the chain is irreducible. From Theorem [H we may suppose that 
A{G) is given with a F-free presentation. 


Let Q = (G, £) be the transition graph of the switch chain, with G the set of perfect matchings 
in G, and £ the set of transitions. We must show that Q is connected, and has diameter at 
most n. Let tt and a be any two perfect matchings in G, and let dist( 7 r, a) = |{i : tt' 7 ^ a[}\. 
Note that dist( 7 r, a) < n, and dist( 7 r, cr) = 0 implies tt = cr. 


Let k be the smallest index such that 7 r(, 7 ^ (j(, and, without loss of generality, suppose 
Ti'k > cr'k. Then there exists i > k such that ■n[ = cr(,, and hence ti[ 7 ^ a[. Then we have 
{k, a'k), {k, O, {i, a'k) eE,i>k and 


k 

£ 


1 

1 


1 

? 


The F-free property of A{G) then implies (!', vr(,) G E. Thus we have (fc, 7 r(,), (^, vr^) G vr and 
(fc, Ti'f), {i, Ti'k) G E. Therefore r G G and (vr, r) G £, where 

T = 7r\{{k,7r'k), {£, Ti'f)} U {{k, vr^), {£, tt^}. 

Note that = vr' for i ^ k, i. However, 7 r(, 7 ^ but t(, = = cr(,. Also 7 ^ but 

= n'k = (y\ if Ti'k = Thus dist( 7 r, a) — 2 < dist(r, a) < dist( 7 r, a) — 1. Hence there is a 
path of at most n edges in Q between any pair of matchings tt, a. Therefore the diameter of 
Q is at most n. □ 
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Computing the permanent exactly is known to be ^^P-complete for the class of chordal 
bipartite graphs [2Z], though this result does not extend even to chordal bipartite graphs of 
bounded degree. The complexity of exact computation of the permanent is unknown for all 
the subclasses of chordal bipartite graphs considered below. That is, with the exception of 
bounded-degree convex graphs, which we consider in Section 12.21 below, and chain graphs, 
which we examine in Section 12.51 

2.2 Convex graphs 

The largest class of graphs considered by Diaconis, Graham and Holmes [S] were those with 
“two-sided restrictions”. These are bipartite graphs G for which A{G) has the consecutive 1 ’s 
property. These have been called convex graphs in the graph theory literature. They were 
introduced by Glover [13], who gave a simple greedy algorithm for hnding a maximum 
matching in such a graph. The consecutive I’s property can be recognised in 0{\E\) time 
by the well-known algorithm of Booth and Lueker [5] . 

A bipartite graph is convex if it is isomorphic to a graph G = ([m] U [n]', E) such that Af{i) 
is an interval [a'i, PI] C [n]' for all i G [m]. Note that this property remains true under an 
arbitrary permutation of [m]. Then 

Lemma 3. Convex graphs are a proper hereditary subclass of chordal bipartite graphs. 

Proof. It is easy to see that the class Convex is hereditary. To see that it is a subclass of 
chordal bipartite graphs, we permute rows so that P[ < /5' when i < j. Now we can show 
that A{G) is F-free. If not, there is a F in some rows i,j and columns k',i'. 

k' t' 

i r 1 1 ■ 

i [ 1 0 _ 

We have i < j but, since the rows of A{G) have consecutive I’s, /?' > P > /?'. This 
contradicts our ordering of the rows of A{G). To see that it is a proper subclass, note 
that there are at most n,!^) = A labelled convex graphs with n rows and columns, 

whereas Spinrad [30] has shown that there are chordal bipartite graphs. (Spinrad 

also gives in [301 Ex. 9.16(c)] an explicit example of a graph that is chordal bipartite but not 
convex.) □ 

It is possible to give excluded subgraph and excluded submatrix characterisations of convex 
graphs, but we will not explore this here, since they are not easy to describe, and appear to 
have little algorithmic application. See [33] for details. 

Blumberg [1] gave an bound on the mixing time of the switch chain for convex graphs 

with r = maxje[n] deg(i). This is a hereditary subclass of convex graphs, since it is easy to see 
that graphs with bounded row- or column-degree form a hereditary subclass of any hereditary 
class. We will give exact algorithms for counting and sampling in this subclass of convex 
graphs. First we will show that these graphs also have bounded column degree. 
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Lemma 4. Let G = ([n] U [n]',E) be a convex graph containing a perfect matching. Let 
r = maxjg[„] deg(i) and c = maxjg[„] deg(j'). Then we have c < 2r — 1. 

Proof. Let M be any perfect matching of G. We first permute the rows of A{G) so that M 
is the diagonal of A, i.e. M {{fG') • * ^ Nl- To bound c, consider any edge (i, j') G E. 
Since G is convex, and {iG') G E, we have i',j' G [a[^ (3'f\ and so \i — i\ < r — 1. Hence 
^ G [j — r + 1, j + r — 1] n [n], and so Af{f) C [j — r + 1, j + r — 1] fl [n]. Therefore we have 
c < 2r — 1. □ 

It is known that there is an exact algorithm for computing the permanent which is linear in 
n for all graphs of bounded treewidth [51 Thm. 1], Convex graphs with r = maxjg[„] deg(z) 
have treewidth at most 2r — 1. Unfortunately, the general algorithm of Courcelle, Makowsky 
and Rotics [8] is superexponential in the treewidth. An algorithm of Fiirer [121 Thm. 3], for 
counting independent sets in graphs of bounded treewidth, could also be applied, since the 
treewidth of the line graph of a convex graph can be bounded by 8r^. (We will not prove these 
facts about treewidth here, since we do not use them.) Combined with Fiirer’s algorithm, 
this gives an algorithm for the permanent which is linear in n, but exponential in r^. 

However, we will not use either of these approaches, since the following dynamic program¬ 
ming algorithm has better time complexity for the problem at hand. 

Lemma 5. Let G = ([n] U [n]',E) be a convex graph containing a perfect matching, and 
let r = maxjg[„] deg(z). Then, for any subgraph G* of G, the permanent of A{G*) can be 
evaluated exactly in time O(r^^n). Hence the permanent can be evaluated in polynomial time 
for all convex graphs with degree bound O (log n/log log n). 

Proof. Let A = A{G*). The algorithm uses triangular windows Wi of width 2r -|- 1 and 
height 2r -|- 1, with corners at A{i, {i — r)'), A{i, [i -f r)') and A{i + 2r, (z -1- r)'). Note, from 
Lemma IH that W) cuts G as shown below. Moreover, for every edge of G there is an index 
i such that the corresponding entry of A appears in the window W^. 


(i-r)' i' {i + r)' 



At iteration i of the algorithm, a subperfect matching Q will be a matching of G*, such that 
(a) Every row j < i has a matching edge; 










column (i + r + 1)' 



(b) Every column j' < min{(i + r)', n'} has a matching edge; 

(c) No row j > i + 2r has a matching edge; 

(d) No column j' > {i + r)' has a matching edge. 

Note that a subperfect matching cannot always be extended to a perfect matching of G*. 
We consider the set 

Si = {M : M = Q nWi and Q is a subperfect matching } . 

Note that |5'i| < (2r)!, since each column of Wi is either empty or contains a unique edge in 
any of positions 1,2,..., j, for j = 1, 2,..., 2r — 1. For M G Si, let 

Ni{M) = \{Q ■.QnW, = M}\, 

be the number of subperfect matchings represented by M. Initially, i = 1 and Si will be the 
set of all matchings in Wi such that every vertex j' < (r + 1)' has a matching edge. When 
i = n — r, all the subperfect matchings represented in Wn-r will be perfect matchings, and 
so we will have 

per(yl) = 

We must show how to update the M and Ni{M) from Wi to Wj+i. Let W* = W fl W+i. 
First we remove row i. We remove all M & Si such that row i contains no matching edge, 
since they cannot correspond to a subperfect matching at iteration (f + 1). Then we delete 
the matching edge in row i from M, for all M E Si. This will produce a set S* of matchings 
in W*, 

S* = {M : M = Q n W* and Q is a subperfect matching } . 

We must now add column (f + r + 1)' to W+i. For all M* G S*, we attempt to augment 
each M* with a matching edge e in column [i + r + l)^ Note that e must be in W+i, and e 
can be in any row which has no matching edge in M*. If no such row exists, we delete M* 
from S*, since it cannot correspond to a subperfect matching at iteration f + 1. Otherwise, 
for each possible choice of e, we add M = M* U {e} to S'j+i, and set 

W+i(M) = {Ni{M*) : M* G Si, M* O Wi+i = M nWi} 

This completes the description of the algorithm. 
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The operations in the update require 0{r\Si\) time, except for the removal of duplicates, 
which can be implemented in OdS**! log |S'j|) = 0{r'^\Si\) time. Therefore, since 

< r^( 2 r)! ~ 2 \/^r^/^( 2 r/e)^'’ = 0 (r^’’), 

and 0{n) updates must be performed, the overall time complexity of the algorithm is 0(r^^n). 
This is polynomial in n if r = 0(\ogn/ log log n). □ 

We can extend the algorithm of Lemma [5] to sample a matching uniformly at random. To 
do this, we must retain the sets Si and the counts Ni{M) (M e Si) used in the permanent 
evaluation. Then the sampling algorithm is a standard dynamic programming traceback 
through Sn-r, • • ■, Si, ..., Si, using the Ni[M) to select matchings with the correct proba¬ 
bility. See [To] for a more complete description of similar uses of traceback sampling. The 
time complexity for sampling a single matching is (9( 'Yhi l*S'i|) = 0 (( 2 r)!n). 

Thus dynamic programming is superior to Markov chain sampling for convex graphs with 
small degree bound, at least if the chain must be run for its a guaranteed mixing time. 

2.3 Biconvex graphs 

Diaconis, Graham and Holmes [S] considered the following subclass of convex graphs. A 
graph G = ([m] U [n]',E) is biconvex if it is convex and Af{j') is an interval [aji,(]ji] C [n] 
for all j' G [n]'. Thus A{G) has the consecutive I’s property for both rows and columns. 

Lemma 6 . Biconvex graphs are a proper hereditary subclass of convex graphs. 

Proof. It is easy to see that the class Biconvex is a hereditary subclass of Convex. To 
see that it is a proper subclass, consider the example: 


1 

2 

3 

4 


1 ' 2 ' 3 ' 4 ' 

10 0 0 
1110 
0 10 0 

0 0 11 


In a biconvex ordering, row 2 must be adjacent to row 1 and row 3. Otherwise, columns 1' 
and 2' cannot be convex. But row 4 must also be adjacent to row 2, or column 3' cannot be 
convex. These conditions clearly cannot be satished simultaneously. □ 


As with convex graphs, it is possible to give excluded subgraph and excluded submatrix 
characterisations of biconvex graphs. Since these are a little easier to describe than for convex 
graphs, we will give the excluded subgraph characterisation. Tucker shows [331 Thm. 10] 
that a bipartite graph is biconvex if and only if it does not contain the graphs I„ for n > 1 , 
III, II 2 , nil, 1112 and III 3 as induced subgraph. Here is a chordless cycle C' 2 n+ 4 , Hi is the 
triomino and HIi is the tripod. 
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Fig. 2: The triomino, the tripod and the graphs II 2 , III 2 and III 3 


We know that the switch chain converges eventnally on biconvex graphs, bnt how qnickly is 
this guaranteed to occur? Unfortunately, the convergence may be exponentially slow. Both 
Matthews [26] and Blumberg |1] gave the following examples Sfc = (M U [n]', £*,), where 
n = 2k — 1: 

{ 1 < i < k and k' < j' < {k + i)'] 
i = k and 1 ' < j' < n'; 
k < i < n and {i — k)' < j' < k'. 


For example, 94 is 


A{9,) 


1' 2' 3' 4' 5' 6 ' 7 ' 


0 0 
0 0 
0 0 
1 1 
1 1 
0 1 
0 0 


0 110 0 
0 1110 
0 1111 
11111 
110 0 0 
110 0 0 
110 0 0 


Let 71 be any perfect matching. Then choosing vr^ < k ' forces tt' = {k + i)' for i G [k — 1 ], and 
similarly choosing vr^ > k ' forces 7r^_,_j = i' for i G [/c — 1]. Thus the set of perfect matchings 
of 9fc is Si U 5 *2, where Fi = {tt : < k '} and 5*2 = {vr : vr^ > k '}. 

Clearly S'inS '2 = {^ 7 : 77 ^ = k'} = {a}, for a single matching a. Moreover, it is not difficult to 
show that there are 2^“^ ways to extend a partial matching vr with tt' = {k + i)' for i G [fc — 1] 
to a perfect matching. One way is to note that the submatrix induced by rows [k, n] and 
columns [k]' is a chain graph, for which the permanent is easy to compute. (See Section [23] 
below and the formula presented there.) Thus [S'! O S' 2 | = 1 and |S'i| = |S' 2 | = 2 ^“^, and 
hence [S'! U ^ 2 ! =2^ — 1. 

Therefore, if the switch chain is started at a random matching in Si, it will need 0(2"') time 
before it reaches a, and it cannot enter S 2 before this occurs. This gives an 0(2") lower 
bound on the mixing time of the chain. This argument can be made completely rigorous, 
see [I] or [26], but we will not do so here. 


2.4 Monotone graphs 

Diaconis, Graham and Holmes [9] considered a structured subclass of biconvex graphs, which 
they called monotone, and showed that the switch chain is ergodic on monotone graphs. 
However, note that Lemma [2| gives a stronger result for the larger class of chordal bipartite 
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graphs. Diaconis, Graham and Holmes [9] conjectured further that the switch chain mixes 
rapidly in the class Monotone. 

A bipartite graph G = {[m] U [n]', E) will be called monotone if it is isomorphic to a convex 
graph such that a' < a' and /S' < /S' for alH <G [m] with i < j. Thus A{G) has a “staircase” 
structure. An example is shown in Fig. [3l 

3 ' 4 ' 5 ' 

0 0 0 ‘ 

1 0 0 
1 1 0 
1 1 1 
0 1 1 



Fig. 3: A monotone graph 


We will always assume that G has the labelling of the dehnition, unless otherwise stated. 
First we show that, if G is row-monotone, it is also column-monotone. 

Lemma 7. A monotone graph is biconvex, and ai' < ay, /Sj/ < (3y ifi', j' G [n]' and i' < j'. 

Proof. For j G [n], let s = min{z G Af{j')} and t = max{z G Af{j')}. If s < i < t, then 
j' > Oi[ > a[ and j' < (3'^ < /S', so j' G [a',/?-] = Af{i) and hence i G Af{j'). Thus 
is the interval [s,f\, so we may take aj> = s, (3y = t. Hence a^i = min{fc : i' G [«fc,/Sfc]} 
and ay = minj/c : j' G [a^j/S^]}, so i' < j' implies «*/ < aj>. Similarly i' < j' implies 

A' ^ A'- ^ 

Next we show a “forbidden submatrix” characterisation of monotone graphs, extending that 
of Lubiw [25] for chordal bipartite graphs. 

Lemma 8. A bipartite graph is monotone if and only if it is isomorphic to a graph G such 
that A{G) has none of the following as an induced 2x2 submatrix: 


r (Gamma) : 


1 1 
1 0 


J (backwards L) : 


0 1 
1 1 


/ (slash) : 


Proof. Suppose G is monotone, but A contains F or / in rows i and j, with i < j. Then 
row-convexity implies A > A'j ^ contradiction. Similarly, if A{G) contains a J, then row- 
convexity implies a' > a', again a contradiction. Thus, if G is a monotone graph, A{G) 
cannot contain r, J or /. 

Now assume A{G) contains no F, J or /. Suppose A/'(i) is not an interval, so there exist 
j' < k' < V so that {i,j'), {i,l') G E, but {i,k') ^ E. Since J\f{k') A 0, there exists s G [n] 
such that (s, k') ^ E. If s < i, then A{G) contains the hrst conhguration below, which is 
either a F or a /, a contradiction. If s > i, then A{G) contains the second conhguration 
below, which is either a J or a /, also a contradiction. 


12 













f 

fc' 


k' i' 

s 

? 


i 

1 

O 

_1 

i 

1 

O 

s 

1 ? 


Therefore suppose that i < j, but a' > a'. Then A{G) contains the hrst conhguration 
below, which is a J or /, a contradiction. Similarly, if /S' > /S', A{G) contains the second 
conhguration below, which is a F or /, again a contradiction. Hence G is monotone. 
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A bipartite permutation graph is a permutation graph which is also bipartite. A graph 
G = (H, E) is a permutation graph if there are permutations tt, a of H so that (tt*, tij) E E if 
and only if vTj < tt^ and at > aj. This can be given an intersection presentation, where vr, a 
are on parallel lines, and connected by lines (n,n), for all v E V. Then {v,w) E E if and 
only if corresponding lines {v,v) and {w,w) cross. 

Spinrad, Brandstadt and Stewart [3l] studied this class of graphs, and gave 0{\E\) time 
algorithms for recognising membership in the class, and for constructing a crossing repre¬ 
sentation. An example is shown in Fig. 0] below. 


2' 3 4' 



1' 2' 3' 4' 


3(G): ^ 

4 


110 0 
1110 
1111 
0 0 11 



Fig. 4: A bipartite permutation graph with its intersection representation 


Our reason for introducing this class of graphs is that the bipartite permutation graphs are 
precisely the monotone graphs. 

Lemma 9. A graph is monotone if and only if it is a bipartite permutation graph. 

Proof. The condition of Lemma El is equivalent to the following. If {i,k'),{j,£') G E are 
such that i < j and k' > f", then {j,k') E E. The conclusion now follows from the 
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characterisation of bipartite permutation graphs given in [3T], in particular Dehnition 3 and 
Theorem 1. □ 


Note that Lemma [H] is not a “forbidden subgraph” characterisation in the usual graph- 
theoretic sense. However, such a characterisation is known. 


Lemma 10. [2^ Lem. 1.46] A graph is monotone if and only if it is chordal bipartite (i.e. it 
has no chordless cycle of length other than 4), o,nd it contains none of the three graphs shown 
in Fig. 0 as an induced subgraph. □ 




Fig. 5: The tripod, the armchair and the stirrer. 


For example, the graph G given in Fig. [6] contains the armchair as a subgraph. 


A{G) = 3 


1 ' 

0 

0 

0 

1 

0 


2 ' 

0 

0 

1 

1 

1 


3' 

0 

1 

1 

1 

1 


4' 

1 

1 

1 

0 

0 


5' 

0 1 
1 
0 
0 
0 


induced 
subgraph; 



3' 

4' 

5' 

4? 

1 

r 0 

1 

0 1 


2 

1 

1 

1 

3'(h 

3 

1 

1 

0 

0 LJ. 

4 

L1 

0 

0 J 

5'" 


Fig. 6: A biconvex graph containing the armchair 
Lemma 11. Monotone graphs are a proper hereditary subclass of biconvex graphs. 


Proof. The hereditary property follows easily from the dehnitions. The inclusion follows 
from Lemma [71 and strict inclusion follows from the example of Fig. El □ 

To apply the switch chain to a monotone graph, we need to know whether it contains any 
perfect matching. If it does, we need to identify one efficiently, in order to start the chain. 
However, these are easy questions. 

Lemma 12. A monotone graph G = ([n] U [n]4E) contains a perfect matching if and only 
if it contains the diagonal matching 6 = {{i,i') ■ i E [n]}. 

Proof. We prove this by induction on n. If n = 1, = {(1,1')}, and there is nothing to 

prove. So, suppose n > 1. Clearly (1,1') G E, or else either 1 or 1' is an isolated vertex, and 
hence G has no perfect matching. We will show that there is a perfect matching M* which 
contains (1,1'). Therefore, suppose that M is any perfect matching in G, with (1,1') ^ M. 
Then (1, j'); (l ^0 ^ ^ some i > 2, f > 2', and we have (1,1') G E. Hence {i,f) ^ E, 
or else A{G) would contain a F. 
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1 


1 ' / 
1 1 
1 ? 


Thus M* = M \ {(1, j'), (h 1')} U {(1,1'), is a perfect matching containing the edge 

(1,1'). Now we use induction on the graph G* given by deleting 1 and 1' from G, which 
contains the perfect matching M* \ {(1,1')}. □ 

We will be particularly interested in Hamiltonian monotone graphs, and there is also an 
easy criterion for Hamiltonicity of a monotone graph. To state this, we consider the graph 
illustrated below, the ladder L^. 


1 2 

f ^ 


4 

f 

n 





F 

3 

' 

, n 


Lemma 13. is a Hamiltonian monotone graph. 

Proof. Clearly is bipartite, and A/'(l), Mifl), A/'(3), ..., Af{n — 1), J\f{n) are, respectively, 

{!', 2'}, {!', 2', 3'}, {2', 3', 4'}, ... , {(u - 2)', (n - 1)', n'}, {{n - 1)', n'}, 

so are non-empty intervals satisfying the required ordering conditions. Finally, has the 
Hamilton cycle 1' —?• 2 —)■ 3' —)• • • • —)■ n' —)■ n —• • ■ —3 —)■ 2' —)■ 1 —)■ 1'. □ 

Then we have the following. 

Lemma 14. A monotone graph G is Hamiltonian if and only if it eontains the ladder as a 
spanning subgraph. 

Proof. If G has a spanning ladder, the Hamilton cycle in the ladder is also a Hamilton cycle 
in G, and so G is Hamiltonian. 

If G = {[m] U [n]', E) is Hamiltonian, it has a perfect matching, so m = n and G contains the 
diagonal matching 6, from Lemma WI\ We will show by induction that G contains L„, and 
so has a spanning ladder. The base case is n = 2 . Then G must be a 4-cycle, so G = L 2 . 

If u > 2, consider any Hamilton cycle H in G. Vertices 1 and 1' lie on this cycle. There are 
two cases: 

(a) The cycle H contains the edge (1,1')- Let j' 7 ^ 1' be adjacent on if to 1, and ^ 7 ^ 1 
be adjacent on H to 1'. Since i,j > 2, biconvexity implies (1,2') G E and (2,1') G E. 
Thus the three edges (1,1'), ( 1 , 2 '), ( 2 , 1 ') of are in E. Also {i,j') ^ E, since G is 
r-free. Hence z ^ j' —)■•••—)■ i is a Hamilton cycle H* in the monotone graph G* 
obtained by deleting 1 and 1' from G. 
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1 ' J' 
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(b) The cycle H does not contain the edge (1,1') G E. Let j', I' be the vertices of H 
adjacent to 1, and i, k the vertices of H adjacent to 1', so that H contains paths i —>■ j' 
and k —)■ avoiding 1 and 1'. Now, since G is F-free, ^ 

Since (1,1') G E, and ^ E for j > 2, convexity implies that (1, 2') G E. Similarly, 
since (1,1'), (i, 1') G -E, with i > 2, convexity implies that (2,1') G E. Thus the three 
edges (1,1'), (1,2'), (2,1') of are in E. Also i ^ j' —)■ A; —i is a Hamilton 

cycle H* in the monotone graph G* obtained by deleting 1 and 1' from G. 


1 


i 

k 


1 ' j' I' 

111 ' 

111 
1 1 1 



In both cases, the edges (1,1'), (1,2'), (2,1') of are in E, and we have a Hamiltonian 
monotone graph G* with bipartition [2,n] U [ 2 ,? 7 ,]'. It now follows by induction that G 
contains L„. □ 

2.5 Chain graphs 

Diaconis, Graham and Holmes called the simplest class of graphs they considered “one-sided 
restriction” graphs. These are usually called chain graphs in the graph theory literature |39j . 
and form a proper subclass of monotone graphs. 

A chain graph is isomorphic to a graph G = {[m] U[n]',E) where A/'(i) = [a*]' for all i G [m], 
with Oi < 02 < ■ ■ • < ttm- Hence chain graphs are a subclass of monotone graphs, given by 
taking a' = 1', = o', for all i G [n]. An example is shown in Fig. [3 

The following easy fact is then true. 

Lemma 15. A/'(j') = [6j, m] for all j' G [n]' with bi > b2 > ■ ■ ■ > bn- 

Proof. Since Oj < Oj+i, {i,j') ^ E implies {i + 1, j') G E. Let bj = min{i : Oj > j}. □ 
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4 ' 
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2 ' 



A{G)-. 


1 1 
2 1 1 

3 1 1 

4 1 1 

5 1 1 


0 0 0 
1 0 0 
1 1 0 
1 1 1 
1 1 1 


Fig. 7: A chain graph 


We note here that chain graphs have a very simple exclnded snbgraph characterisation. A 
bipartite graph is a chain graph if and only if it does not contain 2 K 2 , the graph with fonr 
vertices and two disjoint edges, as an indnced snbgraph. 

Lemma 16. Chain graphs are a proper hereditary subclass of monotone graphs. 


Proof. The hereditary property and the inclusion are clear from the definition. The inclusion 
is strict because a chain graph is a monotone graph with a' = 1' for all i G [n]. Hence the 
biadjacency matrix contains a column of I’s. But there exist monotone graphs which do not 
have a column of I’s. The graph in Fig. [3] is an example. □ 


Diaconis, Graham and Holmes |9] observed that there is a “classical” explicit formula for 
the permanent of a chain graph G. Of course, we must have m = n. Then, if A = A{G), 


per(A) 


0, if Oi < i for any i G [n]; 

nr=i(®* otherwise. 


For example, if 

1' 2' 3' 4' 5' 

1 r 1 1 0 0 0 

2 11100 

A : 3 11110 

4 11111 

5 11111 


then per(A) = 2(3 - 1)(4 - 2)(5 - 3)(5 - 4) = 16. 


After noting that the first oi columns of A are all I’s, and hence identical, this formula can 
be proved by an easy induction on the row order. The proof method can also be used to 
sample a perfect matching uniformly at random in 0(n) time. 

Matthews [26] showed, using a coupling argument, that the mixing time of the switch chain 
for chain graphs is bounded by O(n^logn). Blumberg [I] gave a detailed study of the 
eigenvalues of the transition matrix of the switch chain for this class, based on earlier work 
of Hanlon [T6| . 

These results clearly have little computational application, but establishing the mixing time 
of the switch chain for graphs in the the class Chain is far from trivial. For example, there 
are chain graphs for which the original Jerrum and Sinclair Markov chain [TH| has exponential 
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mixing time. Consider the graph G for which A{G) is lower triangular, so A{i,j) = 1 if * < J, 
A{i,j) = 0 otherwise. Then per(y4) = 1, from the formula above, but the graph G* given 
by deleting vertices 1 and n' has per(74*) = 2*^“^ by the same formula, where A* = A{G*). 
Thus G has one perfect matching, but an exponential number of near-perfect matchings. 
Therefore the algorithm of |T8] will need exponential time to sample a perfect matching 
almost uniformly. 


3 Analysis of the switch chain 

In Section [2] we have seen that the hereditary graph classes considered by Diaconis, Graham 
and Holmes [S] form an ascending sequence: 

Chain c Monotone c Biconvex c Convex c Chordal bipartite. 

We know from Lemma [2] that the switch chain is ergodic for (balanced) bipartite graphs in 
all these classes, and has diameter at most n. Here we consider the mixing time of the switch 
chain for these classes, that is, the time required to reach near-stationarity. We will dehne 
this more formally below. 

Here we take the Computer Science viewpoint of trying to distinguish only between polyno¬ 
mial time (rapid) mixing, and superpolynomial time (slow) mixing. We have observed that 
the switch chain may have exponential mixing time in the class Biconvex, and the switch 
chain has mixing time 0{n^ logn) in the class Chain |26]. Therefore we need only determine 
whether or not the class Monotone has polynomial mixing time. The remainder of this 
section will be devoted to showing that this class does indeed have polynomial mixing time. 

It is usual to measure deviation from the stationary distribution in terms of variation dis¬ 
tance. For a ergodic Markov chain on state space H, transition probabilities P : —)■ [0,1] 
and stationary distribution tt, the distance to stationary at time t, starting in state x G H, is 
Ax{t) = max^co |P*(ic, A) — 7r(H)|. Then the mixing time, Tmix(£), is the hrst time at which 
the Markov chain is £-close to stationarity, maximised over the choice of starting state x: 

Pnix(^) = maxmin{t : A^it) < e}. 

Two remarks about this dehnition. First, the function A^it) is monotonically non-increasing 
in t, so once the t-step distribution of the Markov chain is £-close to stationarity, it remains 
that way. Second, the dependence on of T^ix on e is weak, typically a multiplicative factor 
loge:“^, so it is usual to quote the mixing time as a function of the size of the problem 
instance, in our case n, and ignore the dependence on e (or set £ to some conventional value, 
usually £ = 1/e). For more on mixing time, and the general context for this section of the 
paper, refer to PQ Chaps. 3 & 4] and [171 Chaps. 3 & 5]. 

3.1 Canonical paths and flows 

Although there are other approaches to bounding the mixing time of Markov chains, here 
we will attempt only to apply the canonical paths approach of Jerrum and Sinclair [TB] . For 
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any symmetric Markov chain, this may be described briefly as follows. 


Snppose the problem size is n. The method reqnires constrncting paths of transitions of the 
chain df = Zi —)■ Z 2 = F, between every pair of states X and Y in the state 

space ff, snch that the length £ of each path is at most polynomial in n. This is often easy to 
achieve, bnt to obtain a good npper bonnd on mixing time it is essential that the paths are 
“spread ont” over the state space, and do not overload any particnlar transition. The degree 
of snccess in achieving this end is measured by the congestion of the set of paths. Denote 
the (canonical) path from X to T by 7 xv- Then the congestion q of the chosen paths is 
given by 


Q = max 

(Z,Zt) 


7r(Z)P(Z, Zt) 


7r(X)7r(y) |7xv||. 


X,Y-.-ixY^{.Z,Z^) 


( 1 ) 


where | 7 xy| denotes the length of the path 7 xy, and the maximisation is over all transitions 
Z —)■ Z\ i.e., all pairs with V{Z, Z^) > 0 and Z^ 7 ^ Z, and the sum is over all paths that 
use the transition Z —)■ Zh 


Polynomial mixing time will follow from a polynomial upper bound on congestion. For the 
switch Markov chain, where tt is uniform over D, it can be seen from the definition of q that 
this is equivalent to ensuring that the number of canonical paths through any transition is 
bounded by |D|, possibly multiplied by a polynomial factor. A possible strategy for obtaining 
a good bound on congestion is therefore the following. Fix a transition Z —)■ Zh For every 
canonical path 7 xy from X to Z that uses transition (Z, Zl), specify an encoding PF G D, 
such that, given W and g additional bits of information, we can identify X and Y uniquely. 
It is common to refer to the additional information as “guesses”. In this setting, the mixing 
time Tmix of the chain can be bounded by 2'^^®^poIy(n). Ideally, we seek g = O(logn), since 
this will give a polynomial bound on the mixing time. 

The strategy just described is the one we will apply in this section to the switch chain on 
monotone graphs. In the proof of Lemma |2] we implicitly constructed canonical paths for the 
more general situation of chordal bipartite graphs. The counterexample Qk from Section 12.31 
demonstrates that the congestion of those canonical paths is in fact exponentially large, 
necessarily so even for biconvex graphs. Achieving low congestion in the monotone setting 
is a delicate matter, and will be addressed in the following section. 

It only remains to present a precise relationship between congestion and mixing time, as 
given in m Cor. 5.9], based on Sinclair [29] . 


Lemma 17. The mixing time of a symmetric Markov chain with uniform stationary distri¬ 
bution is bounded by 

21 nix(£) < 2 p(ln |D| + 21 ne~^), 

where g is the congestion with respect to any set of canonical paths. 


Technically, this result applies to a so-called “lazy” version of the Markov chain, in which 
which a fair coin is flipped at each step. A transition is made if the coin comes up “heads”, 
and no transition is made if the coin comes up “tails”. In our case, the loop probabilities, 
which are at least 1/n, are sufficient to avoid the introduction of the lazy version. Then the 
factor 2 could be saved in the mixing time bound, but we will not use this refinement. 
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3.2 Construction of canonical paths 

Our goal is to construct canonical paths between arbitrary pairs X, Y of perfect matchings 
in G. In general, {V, X U Y) is a subgraph of G that is composed of alternating cycles 
Cl U • • • U Cs = X © y, and isolated edges X HY. Assume that these cycles are ordered 
deterministically in some way. (For example, order the cycles according to the smallest 
unprimed vertex in each cycle.) Then we may switch each of the cycles in order, using the 
procedure we will describe below. The isolated edges can be left untouched. Thus, it is 
sufficient to construct the canonical path for a single alternating cycle. 

In fact, we may specialise the canonical path construction even further. Since Monotone 
is a hereditary class, if H is any alternating cycle in G, it is a Hamilton cycle in a smaller 
monotone graph G\y{Hy\. Thus we will assume that G\y{Hy\ = G in the remainder of this 
section, and let H be the (Hamilton) cycle with vertices (ui, U 2 , ^ 2 ,..., Un, t)^). So our 
initial and hnal matchings are as follows: 

X = {(Mi,u(),(u2,U2),...,(Un,<)} 

and 

Y = {(u2,t(),(m3,T2),...,(mi, 0}, 
where we will choose ui = n as the initial vertex of the cycle. 

With each pair (m,u') G [u] x [n]' (which may or may not correspond to an edge of G) we 
associate a point p = [v^n — u + 1) in In particular, we associate the edges in X with 
the points {p* = (u*, n — m* + 1) : i G [u]} and those in Y with points {g* = (u*, n — u i+i +1 ) : 
z G [n]} (interpreting as Ui). We can think of this mapping as assigning Cartesian 

coordinates to the entries of the adjacency matrix A{G) in such a way that the pattern 
of entries plotted in looks exactly like the pattern of entries in the matrix A{G) as 
conventionally written: the x coordinate increases with increasing column number, and the 
^/-coordinate decreases with increasing row number. Denote the x- and //-coordinates of point 
p G by x{p) and p(p), so that p = (x(p),p(p)). 

Let P = {pi,P 2 , ■ ■ ■ ,Pn} U {qi, q 2 ,..., qn} C The alternating (Hamilton) cycle X UY 
corresponds to the cyclic sequence {pi,qi,P 2 ,Q 2 , ■ ■ ■ ,Pn,Qn) of points. Join the adjacent 
points in this sequence by line segments (omitting the return segment from g„ to pi) to yield 
a continuous path H from pi to g„. This path consists of alternating horizontal and vertical 
segments. By the choice Ui = n for the initial vertex we have that p(pi) = y{qn) = 1, he., 
that the path begins and ends at the lowest point (corresponding to the hnal row of the 
matrix). Somewhere along the way the path reaches the highest point x{qk) = x(pfc+i) = n 
(corresponding to the hrst row of the matrix). 

The following lemma is inspired by the “mountain climbing problem” (see, for example |34jj. 
and is proved in the Appendix. 

Lemma 18. Suppose H is as above. There are continuous functions a,f3 : [0,1] —?■ H 
satisfying a( 0 ) = pi, a(l) = qu, /3(0) = /3(1) = pk+i, and y{a{t)) = p(/3(f)) for all 

t G [0,1]. Moreover the set of events 

T = {f G [0,1] : a{t) e P or (d{t) G P} 
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has cardinality at most . 

Note that the trajectories of a{t) and /5(t) will not, in general, move in a uniform sense 
along If: it may be necessary for either or both of a{t) and /d(f) to retreat along If for 
periods in order to make progress later. 

Before entering into the details of the construction of canonical paths, here is a high-level 
overview. We regard the general points [n]^, the special points P, and the path fl as forming 
a board, on which we play out a game involving n tokens. Movements of the tokens will 
mirror switches in the canonical path from X to Y. Place the tokens initially on the points 
Pi,P 2 , ■ ■ ■ iPn that represent the initial matching X. Our aim is to move these n tokens to 
the points qi,q 2 , ■ ■ ■ ,qn in a manner consistent with switches in the original graph. Thus, 
at each step, we will take two tokens and move them to new locations. Note that the two 
tokens we choose form the endpoints of the diagonal of some axis-aligned rectangle, say R. 
We remove these tokens and replace them on the endpoints of the opposite diagonal of R. 
For this operation to represent a valid switch in the original graph, it is necessary for the 
new locations for the tokens to correspond to I’s in the matrix A{G). Ensuring that this 
happens at every step requires the tokens to be moved in a very particular order. Describing 
this order is at the heart of the matter. 

We think of each of a(t) and (3(t) as being a focus [of activity]. As time passes from from 0 
to 1, and points a{t) and /3{t) trace out their trajectories along fl, we shall move tokens in 
the neighbourhood of the foci according to certain rules. (Refer to Figure |H] for a pictorial 
description of such a rule.) If we remove the points a{t) and f3{t) from the path 11, we 
separate 11 into three connected pieces. Denote the points in P lying in the middle piece 
by Pu = Pu{t) and the remaining points by Pl = Piit). Note that PLit)UPuit) = P, except 
at events t, when a{t) E P or f3{t) G P. We shall ensure that tokens lying in Pu are in their 
original locations (i.e., points of the form pf), while those in Pu are in their hnal locations 
(i.e., points of the form qi). As time passes, Puit) tends to increase in size and Pu{t) tends 
to decrease, corresponding to tokens passing from the initial points {pi,... ,pn} to the hnal 
points {qi,...,qn}. 

The arrangement of tokens on the board at time t (viewed as a subset of points in [n]^) will 
be called the configuration of tokens and denoted a = cr{t). As we want a to correspond to 
a perfect matching in G, we insist that it contain one point from every row y = i E [n] of 
the board and one point from every column x = j E [n]. In what follows, it may be worth 
bearing in mind the basic underlying strategy, which is to keep the tokens as far as possible 
on the points P. If this can done effectively, it will be possible to provide an encoding of 
the current state (in the sense described of the previous Section 13.ip by forming a perfect 
matching W guided by the points of P that are not in the current conhguration a. 

As the foci trace out their trajectories, there will be periods (open time intervals) when both 
a{t) and fi{t) are both on (open) vertical segments, and during these periods y{a{t)) and 
y{f3{t)) are either both monotonically increasing or both monotonically decreasing; call these 
v-periods. the v-periods are separated by h-periods, closed time intervals during which one 
of a{t) or /3{t) is stationary and the other is moving horizontally. During v-periods, a{t) is 
constant and well dehned, in a manner that will be explained presently. We do not examine 
the conhguration during h-periods, so its dehnition there is not important; one could decree. 
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e.g., that the conhguration there is the same as during the previous v-period. 

During a v-period, neither a{t) nor (3{t) is a member of P; as a consequence, Piit) UPuit) = 
P, so that every point in P is assigned either to Piit) or Puit). So assume that a{t) and 
f3{t) are both on (open) vertical segments. For convenience we introduce a local naming 
scheme around a(t) and /3(t). Denote by ai and 02 the lower and upper ends of the line 
segment containing a(t). Continue the numbering ..., oq, Oi, 02 , 03 ,... along the path If 
as far as necessary; this convention provides a local numbering of some subsequence of 
Pi,qi,... ,Pn,qn, or its reversal. In a similar way, introduce names for the points around 
/3{t), with bi and 62 being the lower and upper ends of the line segment containing (3{t). 

Suppose cr C [n]^ is a conhguration of tokens. A hole-pair is a pair H of adjacent points 
H = {pi, qi] or H = {qi,Pi+i} of P such that a H H = (/). As the foci move, we attempt to 
maintain the following Invariant I, which is a conjunction of 11-13: 

11 { 01 , 02 } is a hole-pair. 

12 If x(oi) < x{bi) then { 62 ,^ 3 } is a hole-pair; otherwise {bo,bi} is a hole-pair. 

13 The are no hole-pairs beyond these two. 

A number of consequences follow from 11-13: (i) a{t) is completely determined by a{t) and 
(ii) |anP| = n - 1 , (hi) a n P^ C (gi, ^ 2 , • • •, ^n}, and (iv) a n P[/ C {pi,p 2 , • • • ,Pn}- 
(Starting from a hole-pair, and working around If, successive points in P are forced to 
be alternately in and out of cr; this demonstrates (i), and the other conclusions follow as 
byproducts of this argument.) Invariant I may fail after a token-switch, but when this 
happens it will be immediately reinstated at the following switch. There will generally be 
a single token (and exceptionally three) lying outside P (i.e., |cr \ P| = 1 or exceptionally 
|cr \ P| = 3). The position of such a token(s) will be called a dislocation, and is denoted d 
(or d' or d") in Figures [BUTT] below. 

Initially a = {pi,... ,Pn} so Invariant I will certainly not be satished at the outset. A similar 
remark applies to the hnal conhguration cr = (gi,..., g„}. We will see how to hnesse this 
issue later. For the time being, we assume that we are under way, that Invariant I is satished, 
and that t is in a v-period, so that a(t) and (3(t) are on vertical line segments ( 01 , 02 ) and 
( 61 , 62 )- We assume hrst that a(t) (and hence (3(t)) is moving upwards, i.e., that the rate of 
change of y{a{t)) is positive. (It transpires that the situation when a{t) is moving downwards 
creates little extra work, as it can be handled by symmetry, specihcally by rotating the board 
through an angle tt. More detail will be provided immediately after we have dealt with the 
focus-moving-upwards situation.) Depending on the ordering of 1 /( 02 ) and 1 /( 62 ) one of two 
events occurs hrst: either y{a{t)) = 7 /( 02 ) (an cr-event) or y[(3{t)) = 7 /( 62 ) (a /9-event). We 
consider the situation just before this event and just after, and, in particular, what action 
needs to be taken to maintain the invariant. (By “just after” here, we mean “at a point in 
time just after the start of the next v-period”.) 

We proceed by case analysis. There are eight cases I-IV and I*-IV*. First we split on 
whether x(oi) < a;( 6 i) (Cases I and II) or a;(oi) > a;( 6 i) (Cases III and IV), i.e., on the 
horizontal relationship between a{t) and /9(f) before the event. Then we split on whether 
7/(02) < 2 /( 62 ) (Cases I and III) or 7/(02) > 2 /( 62 ) (Cases II and IV), i.e., on whether it is an 
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Case I applies when x{ai) < x{hi), 1 /( 02 ) < //(& 2 ) and x{az) < x{hi). 



Case I* applies when x{ai) < x( 6 i), 2 /( 02 ) < 2 /(& 2 ) and a;(o 3 ) > x( 6 i). 


Fig. 8 : Cases I and I* 

a-event or a /3-event that occurs. Finally we split on whether a{t) and /3(t) have the same 
horizontal relationship after the event (unstarred cases) or opposite (starred cases). Note 
that the eight Cases I-IV* are exhaustive. We consider the eight cases in turn below. Cases 
I and I* illustrate most of the issues, so we treat them in more detail; the remaining cases 
will leave more for the reader. Some shorthand terminology will be useful in describing these 
cases. Let p, g G be points. We say that p is to the left of q (and q is to the right of p) if 
y{p) = y{q) and x{p) < x{q). We say that p is above q (and q is below p) if x{p) = x{q) and 
y{p) > y{q). 

Case I. This case is characterised by x(oi) < x{bi), 2 /( 02 ) < 2 /(^ 2 ) and x(o 3 ) < x{bi). (Refer 
to Figure |H1 which illustrates the layout of the board in the vicinity of the foci.) In this 
hgure and the later ones the conventions are as follows. The dotted-and-dashed line is 
y = y{a{t)) (= y{/3{t))) and marks the current 2 /-coordinate of the foci. Points with tokens 
are grey/red and points without tokens are white. The “before” picture is to the left and the 
“after” to the right. The current location of the dislocation is marked d. Note that d must 
always be the intersection of the vertical line through oi and 02 , and either the horizontal 
line through bo and bi, or through 62 and 63, as appropriate; this is a consequence of the fact 
that the board contains a token on every horizontal and vertical line. 

The action to be taken in this case is to switch the tokens at 03 and d. Recall that “switch”, 
in the token interpretation, means drawing the axis-aligned rectangle with 03 and d at the 
endpoints of one diagonal, and moving the tokens to the endpoints of the other diagonal. In 
this case, the tokens end up at 02 and a new dislocation d'. The most demanding check to 
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Case II applies when x(ai) < x( 6 i), 1 /( 02 ) > //(& 2 ) and x{ai) < x^b^). Only perform this 
switch if 1 /( 64 ) > 2 /( 63 ); otherwise, just move focus [3 to jS'. 




Case II* applies when x{ai) < x(6i), 2 /( 02 ) > 2 /(^ 2 ) and a:(oi) > x^b^). Only perform this 
switch if 2/(64) < 2/(^3); otherwise, just move the foci to a' and j3'. 


Fig. 9: Cases II and II* 

make is that the dislocation d' is legally placed after the switch. Recall that, to be legal, d' 
must be a point corresponding to a 1 in the matrix A{G). This part of the argument will 
always appeal to monotonicity. In this instance we see that the new dislocation d' is above 
03 and to the left of 62 ; since 03 and 62 both correspond to Is in the matrix then so does d', 
by monotonicity. Once this point has been established, and noting that the new hole-pairs 
are { 03 , 04 } and { 62 , 63 }, it is easy to check that 11-13 have been preserved. 

A note of caution. When checking the invariant, it is important to bear in mind that, there 
are no relations between points beyond the ones explicitly stated, and the ones implied by 
the line segments being parallel to the axes. Thus in Figure IH 03 (respectively, 63 ) may be 
to the left or right of 02 (respectively 62 ), and 04 may be above or below 03 . (In case 04 is 
below 64 , the foci will end up below where they have been drawn, but on the same vertical 
lines.) It is crucial not to make use of any unintended relationships between points. It would 
have been impractical (and unnecessary) to display all the possible combinations. 

Case I*. This case is similar to the previous one, but with xi^a^) > x{bi) replacing x{a^) < 
xipi). The fact that a{t) starts to the left of (3{t) and ends up on the right creates an 
additional complication, whose resolution necessitates an extra switch. 

The hrst action to be taken is to switch the tokens at 03 and 61 creating two new dislocations 
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Case III applies when x{ai) > x{hi), 2 /( 02 ) < 2 /(^ 2 ) and x{a^) > x{hi). 

-® 


-T 




fi a. 

^ 'ai 

b) a 

Case III* applies when x(ai) > x(6i), 2/(02) < 2/(^2) and ^(03) < x{hi). 

Fig. 10 : Cases III and III* 


at d' and d”. This is legal because d' is below 03 and to the right of 61 , and because d” is 
to the left of 03 and above hi. (Note that we use here both of the valid modes of reasoning 
arising from monotonicity.) At this point we have three dislocations, which is a worst case. 
Note that we have temporarily violated 13, but we will put this right at the following step, 
when we switch d and d”. This switch requires no justihcation, as we are moving tokens to 
their official locations in P. As an aside, we remark that 04 may be below 03 , in which case 
a{t) and /3(/:) will end up lower than drawn in the hgure. In any case, it is routine to check 
that Invariant I is satisfied after the second switch. 

Case II. This case is characterised by x(ai) < x(6i), 2/(^2) > 2/(^2) and x(ai) < x^b^). (Refer 
to Figure O) A new feature here is that the switch is conditional, and is only made when 64 
is above 63 . In that case, we switch 64 and d; this is legal because the new dislocation d' is 
above Oi and to the left of 64 . 

If 64 is below 63 (indicated by the ghostly 64 in the hgure) then no switch is required to 
maintain the invariant; all that is needed is to move the focus /3 rightwards to fl'. 

Case II*. This case is similar to the previous, but with x(ai) > x{h^) replacing x(ai) < x{h^). 
Again the switch is conditional, this time only being made when 64 is below 63 . In that case 
we switch 64 and d; this is legal because the new dislocation d' is below 02 and to the right 
of 64 . Otherwise we just move the foci to a' and /?'. 
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Case IV applies when x{ai) > x{hi), 2/(02) > 2/(^2) and x{ai) > x{hz)- Only perform the 
second switch if yipi) < y{hp)] otherwise, stop after the hrst with the foci at a' and ( 3 '. 



Case IV* applies when x(ai) > x(6i), 2/(02) > 2/(^2) and x{ai) < x{hp). Only perform the 
second switch if yip^) > yipP)-^ otherwise, stop after the hrst with the foci at a' and / 3 '. 

Fig. 11 : Cases IV and IV* 

Case III. This case is characterised by x{ai) > x{pi), 2/(02) < 2/(^2) and x{apj > x{pi). (Refer 
to Figure [Toll Here we switch 03 and d; this is legal because the new dislocation d' is below 
03 and to the right of hi. 

Case III*. This case is similar to the previous, but with x^ap < x{bi) replacing x^ap < 
x{bi). First we switch 03 and 62, creating new dislocations at d' and d"; this is legal because 
the positions of d' and d" relative to 03 and 62- We hnish by switching d and d". 

Case IV. This case is characterised by x{ai) > x{bi), 2/(02) > yipP) and x{ai) > x{pp. 
(Refer to Figure [HI) Here and in the hnal case, we have an obligatory switch followed by a 
conditional one. First we switch 62 and d, which is legal because the new dislocation d" is 
below 02 and to the right of 62- If is above 63 (indicated by the ghostly 64 in the middle 
frame of the sequence), we stop at this point, with the foci at a' and ( 3 '. Otherwise (64 is 
below 63), we switch 64 and d", which is legal as the new dislocation d' is below 02 and to 
the right of 64. 

Case IV*. This case is similar to the previous, but with x{ai) < x{pp replacing x{ap > 
x{bp. First we switch 62 and d, which is legal because the new dislocation d" is below 02 
and to the right of 62- If &4 is below 63, we stop at this point, with the foci at a' and f 3 '. 
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Otherwise (64 is above 63), we switch 64 and d", which is legal as the new dislocation d' is 
above ai and to the left of 64. 


(h,l) 



V'l 



(n + 1,1) 
(n + 1,0) 


Fig. 12 : Starting the canonical path with the aid of extra “virtual points” 

Cases 1 -IV* exhaust all the possibilities with a{t) is moving upwards. To deal with the 
situation when a{t) is moving downwards, we appeal to symmetry. The argument is as 
follows. Rotate the board through an angle of tt, an operation which is equivalent to the 
change of coordinates x := n — x + 1 and y := n — y + 1 . Note that, relative to the 
new coordinates, the conhguration is still legal, satisfying Invariant I. Note also that the 
underlying matrix is still monotonic, since monotonicity of a matrix is preserved by rotation 
through TT. However, a{t) is now moving upwards in the new coordinate system. Now apply 
the switch or switches dictated by the relevant one of the Cases I-IV* described above. 
Finally rotate the board back through an angle tt to obtain a new conhguration satisfying 
Invariant I. 

We now have to deal with the issue that the initial and hnal conhgurations do not satisfy the 
invariant. (There are no hole-pairs.) To get the procedure above going, we need to create 
one horizontal and one vertical hole-pair. We make space for this by extending the board 
and adding three “virtual points” Pn+i and qn+i- Suppose the coordinates of pi are (h, 1). 
Delete the point pi and add points = (h, 0 ), p„+i = (n -|- 1 , 1 ) and qn+i = (n -|- 1 , 0 ). (In 
terms of the graph G, this corresponds to adding new vertices n + 1 and [n + 1)' and new 
edges (n -|- 1, h'), (n, {n -|- 1)') and (n -|- 1, (n -|- 1)'), together with any further edges implied 
by monotonicity.) Integrate these new points into the path H as illustrated in Figure [121 
Add a token to g^+i and leave the existing tokens in place (including the one at the point 
formally known as pi, which now becomes dislocation d). Place the foci a and fd just below 
d and Pn+i- The invariant is now satisfied, and the arrangement of tokens within the original 
extent of the board is unchanged. We can now start the canonical path construction as 
described earlier. A similar construction (rotated through tt) works to hnish the path. 

Two important points to note. The new board is consistent with monotonicity. (In this 
context, recall that the edge (n, n') must be present in G, and hence the point (/i, 1) is a 
valid position for a token.) Moreover, even though pi has been placed to the left of in the 
hgure, the construction works equally well with pi to the right. 
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3.3 Encoding and congestion 


In Lemma UHl reverse the role of a and / 9 , so that a( 0 ) = a(l) = Pk+ii / 9 ( 0 ) = Pi 
and /S(l) = Qk- Place the n tokens initially on the points {gi,...,g„}, and denote the 
conhguration at time t by cr'{t). Since the trajectories of a{t) and / 5 (f) are oblivious of 
the tokens, Pl = and Pu = Pu{t) are unchanged. According to the invariant, the 

conhguration a' satishes a' nPu C {qi ,..., g„}, a' H Pl C ..., and \a' H P\ = n — 1 . 
At any legal time t, then, 

\{a{t)Ua'{t))nP\= 2 n- 2 . (2) 

Consider a canonical path X = Zq —)■ Zi —)■■■■—)■ = F constructed by the method of 

the Section 13.21 Some of Cases I-IV* involved making two switches; in these cases, call the 
middle conhguration between the two switches (and the corresponding perfect matching Zj) 
transitory. Note the conhgurations that are not transitory are of the form a{t) for some t, 
and these conhgurations satisfy Invariant I. If Zj is not transitory, consider a time t at which 
conhguration a{t) corresponds to Zj. Then cr'(f) is a near complement to a{t) with respect 
to P, and its corresponding perfect matching Z' is a near complement to Zj with respect to 
X UY; specihcally, from equation ([ 2 ]), | (Zj U Z') n (X U Z) | = 2 n — 2 . 

Suppose (Z, Z^) is a valid transition (switch) of the Markov chain. In the usual way, we 
wish to provide every canonical path through (Z, Z^) with a unique encoding, and hence 
bound the total number of paths using (Z, Z^). In fact, our encoding will be an element of 
n X [dn^]. So suppose the transition of interest occurs at (Zj,Zj_|_i) in a canonical path of 
the above form from X to Y. We suppose hrst that C = X U X is a single cycle. At least 
one of Zj and Zj+i is not transitory, say, Zj. Our encoding will be the near complement Z' 
together with some additional data. Consider C" = Zj U Z'. Noting that |C" \ O'! < 2 , the 
hrst piece of extra data we provide is the identity of the (at most) two edges in C" \ C* that 
need to be deleted from C in a bid to recover C\ there are 2 n^ possibilities. Now we need 
to add two edges, but there are only two ways this can be done so that the result is a cycle. 
Finally, we need to signal that the near complement was taken with respect to Zj and not 
Zj+i: a further two possibilities. This gives us our encoding within the set hi x [8n^]. 

In general, the symmetric diherence of X and X is a union of several cycles, say X 0 X = 
Cl U • • • U Cfc U • • • U Cs, where the cycles are presented in the agreed order, and Ck is the 
cycle currently being switched using the procedure described in Section To construct the 
perfect matching IX G that forms the main element of the encoding, proceed as follows. 
Except on the cycle Ck, the matching IX is given by the symmetric difference of X, X 
and Zj. Thus, denoting the vertex set of Ck by 14 , and the complement X \ 14 by 14 , we 
have IX[Xfc] = (X0X©Zj)[Xfc]. On Ck itself, IX[X^] is given by the single-cycle construction 
from Section [ 3 ^ The additional data is encoded by an integer in [Sn^] as before. Note that 
we can reconstruct the common edges X fl X of the initial and hnal states, and all the cycles 
of X0X except Ck, by examining Zj fllX and Zj0lX. Knowing that Ci,..., Ck-i have been 
switched and Ck+i, ■ ■ ■ ,Cs have not, we may then recover X[Xfc] and X[I 4 ]- Finally, we 
recover X[Xfc] and X[Xfc] as in the single-cycle case. Summarising, there are at most 8 n^|r 2 | 
canonical paths using any given transition. 

We have all the quantities needed for the calculation of the congestion g. From the dehnition 






of the switch chain, V{Z, Z^) = 2n From Lemma [181 fh® maximum length of a canonical 
path is Substituting these values into ([I]) yields 

Q < |fl|“^(n^/2)(8n^) |f2| = 4n®. 

Then, from Lemma [T71 noting that the state space has cardinality at most n!, we obtain 
the sought for bound on mixing time. 

Theorem 19. The switch Markov chain has mixing time T{e) < 8n^{nlnn + 21n£“^) = 
0(?7,^log?7,) for any graph G = ([n] U [n]',E) in the class Monotone. 

The big-0 estimate comes from the observation that, if we were to set e smaller than 1/n!, 
say, we could not distinguish between the output of the chain and the uniform distribution 
within polynomial time. By comparison with Theorem Uni the algorithm of |2] has running 
time O(n^log^n), with no bound given on the implied constant. It may be possible to 
improve the analysis here, but it is highly unlikely that we could match the O(n^logn) 
bound conjectured by Diaconis, Graham and Holmes [9]. Currently, this has not even been 
established for chain graphs. 


4 Conclusions 

The hierarchy of hereditary graph classes studied in this paper provides a framework for 
understanding the ergodicity and mixing time of the switch chain for perfect matchings. 
Unfortunately, while we have identihed the largest class for ergodicity, it seems difficult to 
characterise the largest hereditary class having polynomial time mixing. Indeed, it is not 
even clear that such a class exists. 

A simpler question would be to exhibit proper superclasses of monotone graphs for which the 
switch chain mixes rapidly. From the examples 9fc given in subsection 12.31 we see that such 
a superclass cannot contain all biconvex graphs. More precisely, it cannot contain for all 
k. Thus we might consider the class which excludes Sfc for some A;, and try to determine 
the largest k for which this class has rapid mixing. Note that the class excluding Sfc also 
excludes 9r for all i > k, since 9fc is an induced subgraph of 9£- 

The graph 92 is monotone. Therefore the class of 92-free biconvex graphs, that is, biconvex 
graphs that have no induced subgraph isomorphic to 92 , is not a superclass of monotone 
graphs. However, for every hxed k > 2 the class of all 9fc-free biconvex graphs does contain 
all monotone graphs. This can be proved by observing inclusions among the forbidden 
subgraphs in Figs. [2] and [5l Unfortunately, we know very little about the structure of the 
graphs in these classes, let alone whether their structure might support rapid mixing. 
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A Appendix: Proof of Lemma [18 

A.l The parallel mountain climbers problem 


Let P = {pi,p 2 ,... ,Pn) be an ordered sequence of points p, = (a:*, yi) G such that pi = Pn 
and Pi > Pi (i G [n]). We will assume that p* 7 ^ pj for i j, unless {i,j} = {1,^^}- 
Otherwise, we replace p* by p* + £* (1 < i < n), for small enough e: > 0. Then the line 
segments Li = (i G [n — 1]) will be called slopes. The range If is then the point 

set \JgLi Li C The set P C If will be called the nodes of the range. Observe that If 
is a connected one-dimensional manifold with boundary {pi,Pn}, in which the nodes of P 
appear in order. 

The height p : If ^ M (p G If) is the piecewise linear function dehned by interpolation on the 
Pi {i G [n]). Thus, ifp G Lj is such that p = Apj-I-(1 — A)pi+i, then p(p) = Apj-I-(1 — A)pj+i. A 
node Pi will be called a peak if it is a local maximum of p, a valley if it is a local minimum of p, 
and monotone otherwise. The summit ps of If is the highest peak, so Ps = max{p(p) : p G If}. 

We will say that two ranges If, If' are isomorphic if there is bijection between P and P' 
such that y'i < p' if and only if pi < pj. The bijection is extended linearly to the Li. If 
p = Xpi -l- (1 — A)pj+i G Li, then p' = Ap' + (1 — A)p'_,_]^ G L'. Hence, for any range H, there is 
an isomorphic range H' such that Pi = {i,yi — Pi) {i G [n]). For illustration, we will generally 
use this standard presentation of H. Then, for example, we will say a point ui G H is to the 
left or right of a point ^2 G H if it is true in the standard presentation. 

An example range with n = 11 is shown below, hrst in the standard presentation, and then 
in an isomorphic presentation. Here P3,P5,P7,Pio are peaks, Pi,P4,P6,P9,Pii are valleys, and 
P2,P8 are monotone. The summit is py. 



We wish to show the existence of continuous functions a, fd : [0,1] —)■ H, with Q!(0) = pi, 
/9(0) = Pn, a(l) = /S(l) = Ps, such that y{a{t)) = y{f3{t)) for all t G [0,1]. An event 
will be any value of t such that a{t) G P or /3{t) G P, and we will let T be the set of 
events. Our assumption that p* = pj only if i = j or {i,j} = {l,ri} implies that, for any 
Pi (1 < i < n), y{p) = Pi only if p is in the interior of some line segment Lj. This can be 
viewed as the intersection of a horizontal line through pi with the line segment Lj. If this 
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intersection exists, we will denote it by PiLj. The set of such intersections determines all 
potential events. It is a superset of T, and is clearly preserved under isomorphism. 

Note that, in the interval between two successive events, we may take a,j3 to be 

functions dehned by linear interpolation: 

^ {tj+i - + {t- tj)a{tj+i) ^ (tj+i - t)/3{tj) + {t- tj 

tj+i — tj — tj 

Thus a, (3 may be taken as piecewise linear functions. These functions clearly transform in 
an obvious way, under isomorphism. They are completely determined by the set T. Hence 
we wish to determine a tight upper bound on |T|. 

This is usually posed as the (discrete parallel) mountain climbers problem, and we will use its 
terminology here. (See, for example, [T5l|28l[3Tl|3^.) For example, we refer to the parameter 
t as “time”. Two climbers, starting at pi and pn, attempt to meet by moving continuously 
on the slopes of the standard presentation of the range H, so that they are always at the 
same height. Note that one climber must always remain to the left of the summit, and the 
other to the right, since if either reaches the summit, so must the other. Thus, if they meet, 
they must meet at Ps- Can they always meet and, if so, how many events are required to 
ensure that they meet? 

Note that (a version of) this problem hrst arose in analysis, as a problem concerning con¬ 
tinuous functions. (See, for example, [211 IMl |38].) This continuous version is somewhat 
different. There are pathological cases in which the climbers cannot meet and, even when 
they can, there may be no hnite bound on the number of events. 

There appears to be no worst-case analysis of the discrete version of this problem in the 
literature, although [15] gave an O(n^) upper bound on the number of events. Since this 
problem is central to the canonical path construction in this paper, we give an analysis here. 

A.2 Upper bound 

The range graph [3l] TZQ(J1) has a vertex for each intersection PiLj, with l<i<s<j<n 
or 1 < j < s < i < n. These are intersections from points to the left of the summit with 
slopes to the right, and vice versa, representing all possible events during the climb. We also 
add vertices for the initial and hnal states piPn and PsPs- It is easy to see that this vertex 
set is the same for isomorphic ranges and can have cardinality at most (n — l)(n — 2 ) -|- 2 , 
by simple counting. However, this bound can be improved. 

Lemma 20. There are at most {n — 1)^/4 -1- 1 vertices in TZQ(J1). 

Proof. Suppose there are ni slopes to the left of the summit, and n 2 to the right, so ni+n 2 = 
n — 1. Let N{ni,n 2 ) be the maximum number of vertices in 7^^(n). We will show by 
induction that N{ni,n 2 ) < nin 2 + 1. First, it is easy to see that A^(l,? 7 , 2 ) < n 2 -|- 1. 

Next, if any of the nodes from p 2 to Ps-i is monotone, consider the range given by removing 
such a node. Its range graph will have at most N{ni — l,n 2 ) < (ni — l)n 2 + 1 vertices, by 
induction. Adding back the removed node gives rise to at most n 2 new vertices, as shown 


)/?(tj+i) 
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below (in the case that the removed node is P 2 ), so N{ni, n 2 ) < (ni — l)n 2 +1 + ^2 = nin 2 +1 
vertices. 



Adding a leftmost monotone node 


Vertices arising from the new node 


Finally, if there are no monotone nodes and hence p 2 is a peak, consider the range given by 
removing both node p 2 and node ps, which is necessarily a valley. Its range graph will have 
N{ni — 2 ,n 2 ) < (ni — 2)n2 + 1 vertices, using the induction. Adding back p 2 and ps gives 
the situation shown below. 



Adding a new leftmost peak 



Vertices arising from the jth path 


The only new vertices will be in the window determined by the slope to the right of the 
new hrst peak, i.e., between p 2 and p^. The window partitions the range to the right of the 
summit into some number r of disjoint paths. Suppose that the jth path has n 2 ,j slopes 
(j = l,...,r), so X]I=i < ^ 2 - Then it is easy to see that the jth path gives rise to 
exactly 2n2,j new vertices. Thus the total number of vertices added is 2 n 2 j < 2n2. 
Thus iV(ni, n 2 ) < N{ni — 2, n 2 ) + 2 n 2 < {rii — 2)n2 + 1 + 2n2 = nin 2 + 1, and we need only 
observe that nin 2 has a maximum of (n — 1)^/4, when rii = n 2 = {n — l)/2. □ 

There is an edge between two vertices in 7^^ (11) if the ends of the two horizontal lines 
dehning the vertices are connected by slopes. Thus there is an edge between vertices PiLj 
and PkLr if either /c G {i — 1, i + 1} and j = r, or j E {k — 1, k} and r G {i — 1, i). The edges 
of 7^^(n) correspond to possible intervals between successive events. For example, p^L^ is 
adjacent to p^L^ in the range graph of the example range above, as illustrated. 
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It is clear that each vertex, other than piPn and PsPs^ is adjacent to exactly two other vertices, 
corresponding to moving left or right on the range at a peak or valley, and np or down from 
a monotone node. Thns all vertices in the graph have degree 2, apart from piPn and PsPs, 
which have degree 1, since it is only possible to move np from piPn, or down from PsPs- Hence 
the range graph comprises a nniqne path joining piPn to PsPs, and possibly some disjoint 
cycles. The nnmber of edges on this path is at most {n — 1)^/4. 

The fnll range graph for the example with n = 11 above has 22 vertices, and comprises a 
path and a single cycle, with 14 and 8 vertices, respectively: 

PiPii P2L10 PwL 2 P2Lg P9L1 P2LS P&L2 P10L3 PiLiQ PbLia 


P7P7 pgLg PqL^ P3L5 P 5 Lr PiLr P^Lj P4L9 p^Lg p^Lg Piglg 

The npper bonnd on the nnmber of vertices with n = 11 is 10^/4 + 1 = 26, So the 22- 
vertex range graph has fewer vertices than the npper bonnd, and the 14-vertex path has 
considerably fewer. 

This example illustrates that TZQiJl) need not be connected, and may have fewer vertices 
than the upper bound. In fact, the range graph may have as few as n — 1 vertices, for 
example when s = 2. There cannot be fewer, since there must be at least one vertex for each 
of the {n — 3) nodes other than pi, Pm plus the two vertices piPn, PsPs- 

Note also that the effect on TZQiJl) of removing the perturbations ej used to make heights 
unique, is to merge pairs of vertices PiLj_i and ptLj into a single vertex PiPj. The merged 
vertex may have degree 4 in the range graph, so the component of TZQiU) containing piPn 
and PsPs may contain cycles. In this case, the path from pip„ to PsPs iu TZQJl) may no 
longer be unique. See [3lj for an example. 

A.3 Alternative proof 

The proof given above is essentially that of Tucker [51]. Other authors (e.g. [15]) dehne 
the range graph to include all intersections PiLj, and even all “intersections” PiPi, but use 
essentially the same proof. Unfortunately, the range graph proof does not provide a lower 
bound on the worst case, since the component structure of TZQJl) is opaque. To do this, we 
give an equally easy proof of the upper bound by induction on n. If n = 3, there is a single 
peak, and nothing to prove. Otherwise, we divide the problem into three stages. Assume, 
without loss of generality, that the second highest peak pr is to the left of the summit Ps, 
so r < s. Let the deepest valley between pr and ps be pt, so r < t < s. We construct the 
horizontal line Si from pr to the slope Lg to the right of ps, and the horizontal line S2 from 
Pt to the hrst slope encountered to its right. This produces three subproblems, climbing up 
from piPn to summit Si, climbing down from Si to summit S2, and then climbing up from 
S2 to the true summit PsPs- By “climbing down” we mean replacing the height function y 
with —y. 
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Each stage has a range with fewer nodes than the original problem, so each has a path to its 
summit, by induction. Joining these paths together gives a path for the original problem. 
Note that the summits Si and S 2 in stages 1 and 2 must be shrunk to a point so that the 
ranges correspond to the dehnition, but this does not affect the path. 

The uniqueness of the path is less easy to establish using this proof. However, we have 
already proved uniqueness, so we will not consider this further. 

A.4 Lower bound 

If n = 2A; + 1 > 3, range A*, has pi = (0, 0), p 2 k+i = (2A;, 0), k peaks pi at (i, min{A; + i — 
1, 3A; — i + 2}) {i = 2,4,..., 2/c — 2), and k — 1 valleys Pi at (i, max{/c — i + 2,i — /c — 1}) 
(i = 1, 3,..., 2/c — 1). If /c is odd, Pk = {k + 1, 2k) is the summit, and Pk +2 = {k + 2,2k — 1) is 
the second highest peak. If k is even, Pk +2 = {k + 2, 2k) is the summit, and pk = {k, 2k — 1) 
is the second highest peak. Thus s = /c + 1 if /c is odd, and s = k + 2 if k is even. The case 
n = 21 is shown below, together with the three ranges constructed in the inductive proof 
above. 
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Stage 2 Stage 3 

The range of stage 1 can be transformed to Afc_i by minor changes of coordinates. Clearly 
stages 2 and 3 each contribute {k — 1) edges to the path in 7lQ{Ak). Thus, if Tk is the number 
of edges in the path from poP 2 k to PsPs in 7lQ{Ak), we have Ti = 1 , and = Tfe_i + 2 (/c — 1 ), 
from which we can easily deduce = k{k — 1) + 1 = (n — l)(n — 2)/4 + 1. The path from 
PiPn to PsPs is unique, and has path length (n — l)(n — 2)/4 in 7lQ{Ak). Observe that this 
closely matches our upper bound (n — 1)^/4. 

We can compute the number of vertices in TiQ{Ak) exactly. If k is even (resp. odd), the 
total number of horizontal intersections from peaks to the left (resp. right) of ps with slopes 
to its right (resp. left) is 

(k — 1) + (/c — 3) + (A; — 5) + ■ ■ ■ + 3 + 1. 

The total number of horizontal intersections from valleys to the left (resp. right ) of Ps with 
slopes to its right (resp. left) is 

(A; - 2) + (A; - 4) + (A; - 6) + ■ ■ ■ + 4 + 2 , 
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These total k{k — l)/2, whether k is odd or even. The total number of horizontal intersections 
from peaks to the right (resp. left) of Ps with slopes to its left (resp. right) is 

(k — 1) + (/c — 3) + (fc — 5) + ■ • • + 3 + 1, 

and the total number of horizontal intersections from valleys to the right (resp. left) of Ps 
with slopes to its left (resp. right) is 

(fc - 2) + (A; - 4) + (fc - 6) + ■ • • + 4 + 2 . 

These also total k{k — l)/2. So the number of vertices in TZQ{Kk) is k{k — 1) + 2. Hence 
7lQ{Ak) is a single path from piPn to PsPs- 

To summarise, we have proved 

Theorem 21. Let R{n) be the maximum possible number of events required for two parallel 
climbers to meet at the summit on a range H with n nodes. Then R{n) = n^/4 — 0{n). 

From this we may deduce 

Lemma 18. Suppose H is as above. There are continuous functions a, (5 : [0,1] —?■ H 
satisfying a(0) = pi, a(l) = Qk, /3(0) = /1(1) = pk+i, and y{a{t)) = y{f3{t)) for all 

t G [0,1]. Moreover the set of events 

T = {t G [0,1] : a{t) e P or f3{t) G P} 

has cardinality at most . 

Proof, n is a range with 2n nodes {pi, gi,p2, ?2, • • • ,Pn, ?n}- Hence 7^^(n) has at most 
(2n)^/4 = vertices. Observe that this bound cannot be signihcantly improved in the 
worst case. □ 
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